NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Automated Program Repair: Emerging Trends Pose and Expose Problems for Benchmarks

https://doi.org/10.1145/3704997

Renzullo, Joseph; Reiter, Pemma; Weimer, Westley; Forrest, Stephanie (March 2025, ACM Computing Surveys)

Machine learning (ML) pervades the field of Automated Program Repair (APR). Algorithms deploy neural machine translation and large language models (LLMs) to generate software patches, among other tasks. But, there are important differences between these applications of ML and earlier work, which complicates the task of ensuring that results are valid and likely to generalize. A challenge is that the most popular APR evaluation benchmarks were not designed with ML techniques in mind. This is especially true for LLMs, whose large and often poorly-disclosed training datasets may include problems on which they are evaluated. This article reviews work in APR published in the field’s top five venues since 2018, emphasizing emerging trends in the field, including the dramatic rise of ML models, including LLMs. ML-based articles are categorized along structural and functional dimensions, and a variety of issues are identified that these new methods raise. Importantly, data leakage and contamination concerns arise from the challenge of validating ML-based APR using existing benchmarks, which were designed before these techniques were popular. We discuss inconsistencies in evaluation design and performance reporting and offer pointers to solutions where they are available. Finally, we highlight promising new directions that the field is already taking.
more » « less
Full Text Available
Automatically Mitigating Vulnerabilities in Binary Programs via Partially Recompilable Decompilation

https://doi.org/10.1109/TDSC.2024.3482413

Reiter, Pemma; Tay, Hui Jun; Weimer, Westley; Doupé, Adam; Wang, Ruoyu; Forrest, Stephanie (May 2025, IEEE Transactions on Dependable and Secure Computing)

PRD lifts suspect binary functions to source, available for analysis, revision, or review, and creates a patched binary using source- and binary-level techniques. Al- though decompilation and recompilation do not typically succeed on an entire binary, our approach does because it is limited to a few functions, such as those identified by our binary fault localization.
more » « less
Full Text Available
What can program repair learn from code review?

https://doi.org/10.1145/3524459.3527352

Endres, Madeline; Reiter, Pemma; Forrest, Stephanie; Weimer, Westley (May 2022, International Workshop on Automated Program Repair)

Full Text Available
Improving source-code representations to enhance search-based software repair

https://doi.org/10.1145/3512290.3528864

Reiter, Pemma; Espinoza, Antonio M.; Doupé, Adam; Wang, Ruoyu; Weimer, Westley; Forrest, Stephanie (July 2022, Genetic and Evolutionary Computation Conference, Boston, Massachusetts)

Full Text Available

Search for: All records